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Abstract. We introduce optimal algorithms for the problems of data 
placement (DP) and page placement (PP) in networks with a constant 
number of clients each of which has limited storage availability and is- 
sues requests for data objects. The objective for both problems is to 
efficiently utilize each client's storage (deciding where to place replicas 
of objects) so that the total incurred access and installation cost over all 
clients is minimized. In the PP problem an extra constraint on the max- 
imum number of clients served by a single client must be satisfied. Our 
algorithms solve both problems optimally when all objects have uniform 
lengths. When objects lengths are non-uniform we also find the optimal 
solution, albeit a small, asymptotically tight violation of each client's 
storage size by el m ax where Imax is the maximum length of the objects 
and e some arbitrarily small positive constant. We make no assumption 
on the underlying topology of the network (metric, ultrametric etc.), 
thus obtaining the first non-trivial results for non-metric data placement 
problems. 



1 Introduction 

Peer-to-peer file sharing networks have become one of the most popular as- 
pects of everyday internet usage. Users from all around the globe interact in an 
asynchronous manner, benefiting from the availability of the desired content in 
neighboring or more distant locations. The success of such systems stems from 
the exploitation of a new resource, different from the traditional bandwidth - 
related resources, namely the distributed storage. Widespread utilization of this 
new resource is due to the fact that larger capacities have become cheaper, with 
significantly smaller data access times. Interacting users, utilize this resource 
by installing local storage, replicating popular content and then making it avail- 
able to neighboring users, thus dramatically decreasing bandwidth consumption, 
needed to access content from the origin servers at which it is available. 
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A suitable abstract model describing perfectly the aforementioned situation 
is the data placement problem (DP) [1]. Under this model, a set of clients (equiv- 
alently users or machines) with an underlying topology is considered and each 
client has a local amount of storage (cache) installed. Given the set of available 
objects and the preference that each client has for each object, the objective is 
to decide a replication scheme, also referred to as a placement of objects to local 
caches so as to minimize the total access cost among all clients and objects. The 
generalization of this model, under which each client's cache has an upper bound 
on the number of clients it can serve is known as the page placement problem 
(PP) [14]. 

It should be noted, that the term replication is used here instead of caching, 
because under the discussed model, a client cannot change the contents of its 
local storage without reinvocation of a replication algorithm. On the contrary, 
the term caching refers to the process of choosing objects to store locally so as 
to serve requests and use a replacement scheme so as to replace some of them 
for others on-the-fly according to popularity or other criteria. 

Our contributions. We descibe optimal algorithms, combining configurations 
generation and dynamic programming techniques, for the data placement and 
page placement problems, when the number of clients is constant. This is a natu- 
ral variation, interesting from both a theoretical and practical point of view ([3, 4, 
7]). Up to now, the only way to tackle these problems, was the 10-approximation 
algorithm of [2] and the 13-approximation algorithm of [6]), both designed for 
the general case and both based on rounding the solution of an appropriate 
linear program. When object lengths are uniform (or equivalently unit) our al- 
gorithm finds the optimum solution in polynomial time. When object lengths 
are non-uniform, our algorithm returns an optimum solution which violates the 
capacities of the clients' caches by a small, asymptotically tight additive factor. 
Our results, summarized in table 1, can be modified to handle various extensions 
of the basic problems such as the connected data placement problem ([2]) where 
object updates are frequent and consistency of all replicas of each object has 
to be guaranteed and the fc-median variant of DP where bounds are imposed 
on the number of maximum replicas allowed for each object. Furthermore, our 
results are applicable with uniform and non-uniform object lengths and can be 
employed independently of the underlying topology of the network, thus giving 
the first non-trivial results for non-metric DP problems. 

Related work. The study for the data placement problem over an arbitrary 
network where all inter-client distances form a metric was initiated in [1] where 
the authors proved that the problem in the case of objects of uniform length is 
MAXSNP-hard. They also devised a polynomial 20.5-approximation algorithm 
based on the rounding of the optimal solution of a suitable linear program. In 
the case of objects of non-uniform length, the authors proved that the problem 
of deciding whether an instance admits a solution is NP-complete and provided a 
polynomial 20.5-approximation algorithm that produces a solution at which the 
capacity of each client's cache exceeds its capacity in the optimum solution by 
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Table 1. The main known results on data placement problems (*non-uniform lengths 
with constant blow-up on clients and cache capacities, **non-uniform lengths with 
constant blow-up on cache capacities only). 



at most the length of the largest object. The approximation ratio for unit-sized 
objects was later improved to 10 in [16] and [2]. 

Various previous works have also considered variants of the data placement 
problem in terms of the underlying topology. In [13] the authors consider the case 
of distances in the underlying topologies that form an ultrametric, i.e. are non- 
negative, symmetric and satisfy the strong triangle inequality that is d(i,j) < 
max{d(i, k),d(k,j)} for clients i, j, k. The authors consider a simple hierarchical 
network consisting of three distances between the clients and devise a polynomial 
algorithm for the case of unit-sized objects by transforming it to a capacitated 
transportation problem [5]. For the case of general ultrametrics, an optimal 
polynomial algorithm is given in [9] based on a reduction to the min-cost flow 
problem. 

The page-placement problem is an important generalization of the data place- 
ment and was proposed and studied in [14]. In this problem, each client has an 
extra constraint on the number of other clients it can serve, apart from the con- 
straint on the capacity of its cache. In [14], the authors give 5-approximation 
algorithm for the problem which violates both client and cache capacity con- 
straints by a logarithmic factor at most. In [6], the logarithmic violation of both 
capacity constraints was improved to constant with a 13-approximation algo- 
rithm. Finally in [11] and [15] a game-theoretic aspect of the data placement 
problem is studied, where clients are considered to be selfish agents. In both 
works, algorithms are provided which stabilize clients in equilibrium placements. 

All previous results capture situations where write requests are rarely or 
never issued for the objects. In [2] the authors consider the case when write 
requests are common and formulate the connected data placement problem, in 
which it is required that all replicas of an object o are connected via a Steiner 
tree T a to a root r a , which can later be used as a multicast tree. The objective 
is the minimization of the total incurred access cost and the cost of building the 
Steiner tree. A 14-approximation algorithm for the problem is given in [2]. This 
problem is a generalization of the connected facility location problem for which 
the best known approximation ratio is 8.55 [17]. 



In section 2 we formally define the DP problem and introduce appropriate 
notation. In section 3 we present our main results for the DP problem, whereas 
in section 4 we present an algorithm for the page placement problem and also 
briefly discuss modifications for the other extensions. 

2 Problem definition 

The data placement problem we consider in this paper is identical to the one 
in [1] and is abstracted as follows 3 . There is a network Af consisting of a set 
M. of M = users (clients) and a universe O of TV = \0\ objects. In what 
follows we use the terms user and machine interchangeably. Each object o £ O 
has length l and each user j £ M has a local capacity Cj for the storage of 
objects. The distance between the users can be represented by a distance matrix 
D (not necessarily symmetric) where dij denotes the distance from j to i. The 
matrix D models the underlying topology. We do not assume any restrictions 
(e.g. metric) on the distances. Each user i requests access to a set of objects 
Ri t= O, namely its request set. For each object o in its request set, client i has 
a demand of access w io > 0. This demand can be interpreted as the frequency 
under which user i requests object o. The subset Pi of its request set, that i 
chooses to replicate locally is referred to as its 'placement. Obviously, \Pi\ < Ci 
for unit-sized objects. We assume an installation cost f° for each object o and 
each cache i. The objective is to choose placements of objects for every client 
such as the total induced access and installation costs for all objects and all 
clients is minimized. In the following, we will assume without loss of generality 
that each object oeOis requested by at least one user. 

We define a configuration c C A4 as a (non empty) subset of the M machines. 
Thus, we have 2 M — 1 distinct configurations and we denote by C the set of all 
configurations. For a configuration c G C and a user j we say that j is used 
with respect to c, denoted by j £ c, if the configuration c contains j's cache, i.e. 
machine mj <E c. It will be also convenient to introduce the following notation: 
p C j = 1 if j e c, and p c j = otherwise. For an object o, we define a c-placement 
with respect to o, as a placement of object o to the machines belonging to c. 

Introducing binary variables x oc to denote whether we choose or not the c- 
placement with respect to o, we can formulate our problem as an integer linear 
program, denoted by ILP in the sequel, in the following way: 



minimize 





x oc e {0,1} o e 0, ceC 



3 In [2] a seemingly different but essentially equivalent formulation of the problem is 
described. 



where cost oc is the total cost induced if configuration c is used for the placement 
of object o, that is cost oc = J^jeM^ 1 ~ Pcj) Wjo h d {c) + J2jeM P^j fji with 
dj(c) = minji : Pc .,=i dj'j the nearest distance at which client j can access object 
o under configuration c. The first set of constraints essentially states that the set 
of objects that each user replicates must not violate the user's cache constraint, 
while the second set states that for each object exactly one configuration should 
be chosen. In what follows we denote by OPT the optimum solution of the 
previous program. 

Note that the problem, as defined above does not always admit a feasible 
solution. In order to avoid trivial cases of infeasibility we assume in the sequel 
that J2ieM < J2oeo '° which essentially states that all clients can collectively 
store the union of the requested objects. Other works ([13,9]) assume existense 
of a distant server, that is, a user holding as a fixed placement the universe of 
objects, which essentially tackles the problem of infeasibility. For the case of 
uniform sized objects, this assumption has no effect in the problem's hardness 
since the hardness result of Baev et al. [1] also holds in this case. However, in 
the case of non-uniform sized objects, their result does not hold immediately, 
since it relies on the fact that it is sometimes not possible to find any feasible 
solution. When a distant server exists, any instance always admits a feasible 
solution. Nevertheless, their proof of non-approximability can be adapted and 
thus the following result can be obtained. Due to space limitations, we defer the 
details of the proof to the full version of this paper. 

Proposition 1. For any polynomial time computable function a(N), the data 
placement problem with non uniform object lengths and without any augmenta- 
tion in cache capacities, cannot be approximated within a factor ofa(N), unless 
P=NP. 

The problem can also be stated as a constrained shortest path problem as 
follows: we introduce a node for each binary variable x oc and two nodes s and 
t and connect them as follows: for each Oi, 1 < i < N — lwe connect the node 
that represents x 0i c with every node that represents x 0i+1 c for all c. Furthermore 
we connect node s with nodes x 0lC and node t with nodes x ONC for all c. At 
each edge (x 0iC , x 0i+lC ') we assign a weight equal to cost 0iC for 1 < i < N — 1. 
Edges (x ONC ,t) have weight cost ONC and edges (s,x 0lC ) have a weight of 0. The 
objective is to find the shortest path between nodes s and t while respecting 
cache capacity constraints on each node. These constraints are assigned to each 
node x QiC by simply summing up for each client contained in configuration c the 
current cache contents up to object Oj. This constrained shortest path problem 
can be solved using dynamic programming. It leads to the algorithm presented 
in the next section. 

3 Constant number of clients 

In this section we focus in the case where the number of clients in the network (i.e. 
users) is a constant. To the best of our knowledge these are the first results for 



this natural variation. We show that the data placement problem can be solved 
optimally in polynomial time when all objects are unit-sized. When objects have 
different sizes we are still able to solve the problem optimally, with only a small 
and asymptotically tight violation of the cache capacities. 

3.1 Uniform length objects 

Let us define an available cache vector r = (n, r^, . . . , tm), where rj denotes the 
current space size available on cache of user j, for 1 < j < M. For 1 < k < N, let 
us denote by fk{r) the cost associated with the optimal way of placing objects 
Oi, ... ,Ok on the clients' caches, assuming that the current available cache vector 
is r. For any configuration c, we denote by 5 C = . . . , 8^f) its machine-profile 
vector, with S l c = 1 if configuration c uses machine m;, and 5 % c = otherwise. We 
assume in this section that all lengths satisfy l a = 1, but the following recurrence 
holds for the general case and it will be also used in the next section. One has 

f k {r) = min (cost 0kC + f k -i(r - l a Jc)), 

c : r—l o .S c >0 

with fo(r) = for any r. Finding the optimum cost to ILP reduces to the 
computation of /jv(r) with r = (C\, Ci, ...Cm)- 

Theorem 1. The non-metric data placement problem with uniform length 
objects and a fixed number of clients can be solved optimally in polynomial time. 

Proof. By using standard techniques (see for example [8]), the above recurrence 
leads to an efficient dynamic programming algorithm to obtain the optimal cost 
and solution of ILP. The cache vectors r can take values from a set of size 
Ilj=i Q — ^max where C max is the maximum cache size. Assuming the values 
fk(f) are stored in an array and computed from k = 1 to k = N, then for each r 
the time needed to compute /fe(r) is 0(2 M ), i.e. a constant time, since at most 
2 M configurations need to be checked. The total time complexity is therefore 
0(N2 M C^ ax ). Notice that since objects are unit-sized, i.e. l a = 1, Vo G O, 
we can assume without loss of generality that for any capacity we have that 
Cj < N. If it is not the case, by changing this capacity to Cj := N, we obtain 
an equivalent instance because in the model considered, a client has no incentive 
to replicate any distinct object twice, since this would have no effect in the total 
access cost. Finally, the computation time becomes 0(N M+1 ). □ 

3.2 Non-uniform length objects 

The previous dynamic programming algorithm is in fact pseudo-polynomial, 
since the complexity 0(N M C^ ax ) depends on the maximum cache size C max . 
In the case of unit-sized objects we are able to bound C max by the total number 
of objects and thus obtain a polynomial time algorithm. In the case of objects of 
arbitrary length the bound C max < N does not hold and the algorithm remains 
pseudo-polynomial . 



Algorithm 1: DP-NU(7W, O, e) 



1 a «- {el max )/N; 

II update object lengths 

2 foreach o 6 do 

3 | *o «- L'o/«J; 

// update cache sizes 

4 foreach j £ A4 do 

s | c$«-|C,-oj; 

// use updated lengths and cache sizes with dynamic programming 

6 OPT a 4— optimum solution of ILP a ; 

7 Output OPT a ; 



In what follows, we show how to design a polynomial time algorithm in the 
case of arbitrary-sized objects. We let a — el max /N where e is an arbitrarily 
small positive constant and modify the object lengths and cache sizes appropri- 
ately. To compute a solution we use algorithm 1 where ILP Q denotes the integer 
linear program obtained from ILP by using length l' a (resp. cache C-) instead of 
l a (resp. Cj) for all objects o and clients j. 

Notice however that the cost function in ILP Q is the same as in ILP, i.e. the 
costs cost oc =^2 jeM (l- Pcj) Wjo lo dj (c) + ^2 jeM Pcj fj are calculated by using 
the initial lengths l a . We have the following lemma. 

Lemma 1. Given an a > 0, any solution x for ILP is a solution for ILP a . 



Proof. Let a: be a solution of ILP. One has, Vj £ Ai, 
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(1) 

, c are integers, 



where the first inequality comes from the fact that p C j and x 
the second inequality is a standard one, and the last inequality comes from 
J2oeo Scec ^° P c 3 Xoc — Cj since x is a feasible solution of ILP. Therefore, x 



satisfies J2oeo Scgc l 'o 

Pcj Xoc 5~ Cjj and x is a feasible solution of ILP Q . □ 



From the above lemma, we can immediately conclude that if ILP a has no 
solutions, then the same holds for ILP. However, if ILP has no feasible solutions, 
ILP Q could have feasible solutions. In the following, we assume that ILP admits 
at least one feasible solution, in order to be able to define an optimal solution 
denoted by OPT. 



Lemma 2. The algorithm DP-NU(A4,0) returns an optimal solution for ILP 
using el max blow-up in time polynomial in N and 1/e, where e is an arbitrarily 
small positive constant and l max is the length of the largest object. 



Proof. First, notice that by Lemma 1 the cost of the solution OPT a is not greater 
than the cost of the solution OPT. Furthermore, we have that 



E E \ pa x oc > E E [i - pa x oc > E E \^ 



Xoc - N 



oeo cec 
which becomes 



oeo cec 



E E l ° < a E E 

oeo cec oeo cec 

C 



< a 



_3 

a 



Pcj x oc + aN 
+ Na<Cj+Na (by using inequality (1)) 



Putting a = el max /N we get for the initial instance 

^ ^ ^ ^ to Pcj X oc ^ Cj ~\~ slm 

oeo cec 



(2) 



thus, each cache size is violated by at most el max - 

For the complexity, notice that for any user j's cache, we can assume with- 
out loss of generality that Cj < Nl max . If it is not the case, by changing the 
capacity to Cj := Nl max , we obtain an equivalent instance because in the model 
considered, a client has no incentive to replicate any distinct object twice, since 
this would have no effect in the total access cost. Wc have therefore, 



C'j = 



c, 



d Nl 

< — < 

a a 



< 



N 



Finally, we obtain C' max = maxj G x Cj < N 2 /e and by a similar analysis as 
in theorem 1, the complexity of 0(N2 M C^ ax ) becomes 0(N 2M+1 e- M ). Notice 
that if a is large enough, some lengths |jo/a;J can become equal to zero. In that 
case, the dynamic programming algorithm can be accelerated for such objects, 
since an optimal placement is to put them on each machine. □ 

Using Lemma 2 we obtain the following theorem. 

Theorem 2. The non-metric data placement problem, with non-uniform object 
lengths and a fixed number of clients, can be solved optimally in polynomial time 
using el max blow-up on the machines' capacity, where e is an arbitrarily small 
positive constant and l max is the length of the largest object. 

The elmax blow-up stated in the previous theorem is asymptotically tight. 
In order to clarify this, consider an instance with N objects and two clients Mi 
and M 2 . The lengths of the objects are li = (1 — S)/N for i = 1, . . . , TV — 1 
and In = lmax — 1/e where < e < 1 and < S < 1. The cache capacities of 
the clients are C\ — el max = 1 for Mi and C2 = 1/e for M 2 . All installation 
costs are 0. Client Mi has a demand equal to 1 for the first N — 1 objects and 



no demand for object N. Client M 2 has also a demand of 1 for the first N — 1 
objects and a demand of N for the N-th object. In the optimum solution OPT, 
Mi replicates all the (N — 1) objects and Mi replicates only object N. When 
our algorithm is employed, the lengths of objects i, I < i < N — 1 become 
l'i = [k/a\ = [l-5\ =0. The length of the JV-th object becomes l' max = [N/e\ 
while the cache sizes become C[ = [l/a\ = N and C 2 = \_N/e\ . In the optimum 
solution OPT a client Mi will again choose to replicate the (N — 1) objects it 
has demand for, but client M 2 can now choose all N objects. After restoring the 
original object lengths and capacities, the total blow-up is only due to Mi and 
is equal to (N-l)((l-5)/N). Choosing S = l/(N-l) we get 1-2/N the limit 
of which is 1 = e • - = e • l ma x, as N approaches infinity. 

4 The page placement problem 

In the page placement problem, there are bounds imposed on the number of 
clients that can connect to a specified client's cache in order to access objects. 
We denote by kj the maximum number of users that can access a given user 
j's cache. If the same user access cache j for different objects it is counted only 
once. Clearly, in this problem a client requesting an object can not always use 
the nearest machine which replicates that object to access it. 

We need to introduce some terminology and notations. Let us define an 
available load vector t = (ti, ti, . . . , £m), where tj denotes kj minus the current 
number of users connected to the cache j. Notice that the number of load vectors 
is bounded by Iljlife + 1) — + 1) M - For any configuration c, we denote as 
before by S c its machine-profile vector, i.e. for 1 < i < M, 5 Z C = 1 if configuration 
c uses machine m„ and 5 l c — otherwise. 

Given an object o and a configuration c, a c-placement is a placement such 
that a machine m receives the object o if and only if to G c. In a c-placement, 
the machines outside c need a way to access the object o they are requesting. We 
call such a way a connection pattern p with respect to the configuration c, and 
we denote by # c the set of all such possible connection patterns. Given p E <P C , 
for all j £ c and i G c, we put pij = 1 if user j access object o from user i, 
and pij = otherwise. Moreover, for all j £ c and i we have pij = 0. Notice 
that |<2> c | is bounded by |c| M -l c l < M M . Finally, we define a history pattern s 
in the following way: = 1 if machine nij has previously used machine nii to 
access an object, and Sij = otherwise. The number of history patterns is equal 
to 2 M ( M_1 )/ 2 . Given p and s we denote by s V p the updated history pattern 
taking into account the current connection pattern p. The updated pattern can 
be obtained by performing a logical or between p and s, i.e. (s V p)ij = V pij. 

For a connection pattern p £ p c and a history pattern s, we denote by 
Ap^ s = {A 1 s )f£i the vector which indicates for each machine rrii the number 
of machines which are connected to rrii for the first time. Such a vector can be 
obtained in the following way: for 1 < i < M, one has A p s — Y^jL\ Pij(^ ~ s ij)- 
Finally, we define cost D ^ p = J2i,jeM d ij w joloPij + E tec f°- 



For 1 < k < N, let us denote by fk(r,t,s) the cost associated with the 
optimal way of placing objects o\, . . . ,Ofc on the clients' caches, assuming that 
the current available cache vector, load vector and access vector are respectively 
r, t, s. One has 

fk(r,t,s)= mm min {cost 0k ^ p + fk-i(r - l Jc, * - \,s, s V p)) , 

r-l k 8 c >0 t-A PiS >0 

with fo{r, t,s) — for any r, t, s. 

Theorem 3. The non-metric page placement problem with uniform length ob- 
jects and a fixed number of clients can be solved optimally in polynomial time. 

Proof Finding the optimum cost to the problem reduces to the computation 
of /Ar(r, t, s) with r = (Ci, . . . Cm), t = (k\, . . . , ku) and s = (0, . . . 0). The 
complexity for computing fk(r,s,t) is 0(2 M M M M), and there are at most 
NC^ ax M M 2 M(M - 1 ^ 2 {r,s,t) triplets. As in section 3 (Theorem 1), we can 
assume that C max < N and obtain an overall complexity of 0(N M+1 ). □ 

For the non uniform case, the same recurrence relation holds, and using a similar 
technique and analysis as in section 3.2 (not repeated here due to space limi- 
tations) the complexity becomes 0(7V 2M+1 e~ M ) and we obtain the following 
result: 

Theorem 4. The non-metric page placement problem with non-uniform object 
lengths and a fixed number of clients, can be solved optimally in polynomial time 
using el max blow-up on the machines' capacity, where e is an arbitrarily small 
positive constant and l max is the length of the largest object. 

5 Concluding Remarks 

In this paper, we addressed the problem of replicating data over a constant 
number of network clients and designed optimal algorithms via utilization of the 
notion of configurations. If all data objects are equal in size, our algorithm finds 
in polynomial time the optimum solution. When lengths of objects differ, a small 
violation of each client's cache capacity constraint is enough, so as to be able to 
find the optimum solution. 

Our technique constitutes a general framework that can also be used for 
solving optimally various common extensions of the problem such as: (a) the k- 
median variant in which an upper bound k a is imposed on the number of copies 
of each object o that can be replicated in the network and (b) the connected 
data placement problem [2], where apart from placing objects, all clients hold- 
ing replicas of the same object should also be innterconnected via a directed 
Stciner tree. Furthermore our technique can be applied for other variants of 
data placement for example the fault tolerant data placement (derived from the 
fault-tolerant facility location problem [18]) where each client can be served by 
a given number of machines and the cost is obtained by summing the costs of 



access with respect to those machines. We defer the details for these and other 
extensions, due to space limitations, for the full version of this paper. 

The proposed algorithms remain polynomial independently of any metric. 
An important aspect of further research is the modification of the described 
algorithm so as to be able to handle extensions involving payments. In such 
extensions, apart from object preferences, a client also has a budget to spend 
and pay other clients to convince them to replicate certain objects. 
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